首页> 外文OA文献 >DeBaCl: A Python Package for Interactive DEnsity-BAsed CLustering
【2h】

DeBaCl: A Python Package for Interactive DEnsity-BAsed CLustering

机译:DeBaCl:用于交互式DEnsity-Based CLustering的python包

摘要

The level set tree approach of Hartigan (1975) provides a probabilisticallybased and highly interpretable encoding of the clustering behavior of adataset. By representing the hierarchy of data modes as a dendrogram of thelevel sets of a density estimator, this approach offers many advantages forexploratory analysis and clustering, especially for complex andhigh-dimensional data. Several R packages exist for level set tree estimation,but their practical usefulness is limited by computational inefficiency,absence of interactive graphical capabilities and, from a theoreticalperspective, reliance on asymptotic approximations. To make it easier forpractitioners to capture the advantages of level set trees, we have written thePython package DeBaCl for DEnsity-BAsed CLustering. In this article weillustrate how DeBaCl's level set tree estimates can be used for difficultclustering tasks and interactive graphical data analysis. The package isintended to promote the practical use of level set trees through improvementsin computational efficiency and a high degree of user customization. Inaddition, the flexible algorithms implemented in DeBaCl enjoy finite sampleaccuracy, as demonstrated in recent literature on density clustering. Finally,we show the level set tree framework can be easily extended to deal withfunctional data.
机译:Hartigan(1975)的水平集树方法为数据集的聚类行为提供了一种基于概率的且可高度解释的编码。通过将数据模式的层次结构表示为密度估算器的水平集的树状图,此方法为探索性分析和聚类(尤其是复杂的高维数据)提供了许多优势。存在一些用于水平集树估计的R包,但是它们的实用性受到计算效率低,缺少交互式图形功能以及从理论角度上依赖渐近逼近的限制。为了使从业人员更容易掌握级别集树的优势,我们编写了Python软件包DeBaCl进行DEnsity-BAsed CLustering。在本文中,我们说明了DeBaCl的水平集树估计如何用于困难的聚类任务和交互式图形数据分析。该软件包旨在通过提高计算效率和高度用户自定义来促进水平集树的实际使用。另外,如最近有关密度聚类的文献所证明的那样,在DeBaCl中实现的灵活算法具有有限的样本准确性。最后,我们展示了可以轻松扩展级别集树框架以处理功能数据的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号